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Abstract 


This report documents the GMAO’s Goddard Earth Observing System sea ice and ocean data 
assimilation systems (GEOS iODAS) and their evolution from the first reanalysis test, through 
the implementation that was used to initialize the GMAO decadal forecasts, and to the current 
system that is used to initialize the GMAO seasonal forecasts. The iODAS assimilates a wide 
range of observations into the ocean and sea ice components: in-situ temperature and salinity 
profiles, sea level anomalies from satellite altimetry, analyzed SST, and sea-ice concentration. 
The climatological sea surface salinity is used to constrain the surface salinity prior to the Argo 
years. Climatological temperature and salinity gridded data sets from the 2009 version of the 
World Ocean Atlas (WOA09) are used to help constrain the analysis in data sparse areas. The 
latest analysis, GEOS ODAS5.2, is diagnosed through detailed studies of the statistics of the 
innovations and analysis departures, comparisons with independent data, and integrated values 
such as volume transport. Finally, the climatologies of temperature and salinity fields from the 
Argo era, 2002-2011, are presented and compared with the WOA09. 
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1 Introduction 


The Goddard Earth Observing System integrated Ocean Data Assimilation System (GEOS 
iODAS) described in this document is a system for both ocean and sea-ice data assimilation. It is 
integrated within the broader GEOS model and data assimilation system using the Earth System 
Modeling Framework (ESMF). By using an object-oriented framework, the analysis systems are 
model-independent, but have been implemented and tuned for the Modular Ocean Model Version 
4.1 (MOM4.1) developed by the NOAA Geophysical Fluid Dynamics Laboratory (GFDL) and 
the CICE model developed by Los Alamos National Laboratory. 

The primary performance driver of the GEOS iODAS products (fields of temperature, salinity, 
currents, sea surface height, sea ice thickness and concentration) is the initialization of short-term 
climate forecasts. However, the analyses should also be climate-quality to support studies of 
climate variability and to serve as the ocean component of an Integrated Earth System Analysis 
(IESA). Since climate forecasts require hindcasts for forecast calibration, a byproduct of the 
ocean and sea-ice reanalysis is a retrospective estimate of the state of the ocean and sea-ice, and 
of the atmosphere when the analysis is conducted in the GEOS coupled atmosphere ocean general 
circulation model (AOGCM). 

Satellites provide measurements of the ocean surface or an integrated column, not profiles as for 
the atmosphere. For altimeter data in particular, full use of the data requires the assimilation to 
interpret the signal as temperature, salinity and current signals below the surface. Until the advent 
of Argo floats early this century, most of the in-situ ocean data have been limited to temperature 
profiles. In order to maintain the ocean’s water masses in the analysis, salinity corrections have 
to accompany temperature corrections. Thus most of the system development has been directed 
towards multivariate covariance modeling. The approach has been to rely on the statistics from 
ensembles, either static (ensemble optimal interpolation: EnOI) or dynamic (Ensemble Kalman 
Filter: EnKF), or a hybrid of static and dynamic covariances for the background covariances. 

Since the role of data assimilation is to find a compromise between the simplified representation 
of the model physics and observations prone to instrumental errors and biases, the estimated state 
is only optimum relative to the a priori hypothesis of model and observation errors. This report 
describes the parameters chosen during the evolution of the system through four implementations 
(GEOS ODAS4.0, ODAS5.0, ODAS5.1 and ODAS5.2) as we tested and evaluated the system in 
preparation for climate forecasts. Due to the lack of independent validating data, the sparsity of 
data prior to the altimeter period and completion of the Tropical Atmosphere Ocean/Triangle 
Trans-Ocean Buoy Network (TAO/TRITON) moored buoy array, and because evaluation of 
system performance by comparing with other ocean assimilation systems required long time 
series, each of these implementations was used for a reanalysis of the historical data. 

The report is organized as follows, section 2 describes the dynamical models used for the 
assimilation, section 3 presents the observations used in the assimilation, and section 4 describes 
aspects of the data assimilation methodology used. Section 5 discusses the assimilation set-up and 
its evolution, including parameters used for background and observations errors. Section 6 
presents diagnostics and results of the latest analyses, GEOS ODAS5.2. The report also includes 
a presentation of the climatologies of several fields over the more recent Argo period, 2002-2011. 


2 The GEOS-5 Coupled Model 

The model used in this report has two configurations, differing from one another only by the 
inclusion or exclusion of the atmospheric component, the Goddard Earth Observing System 
Model, Version 5 (GEOS-5) Atmospheric General Circulation Model (AGCM). 
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Figure 1: Northern Hemisphere view of the ocean tripolar grid. The blue dots indicate the 

center of the grid cell 

Configuration 1 (CM1) is a fully coupled model that integrates the Fortuna version of the GEOS- 
5 AGCM with the Catchment land surface model (LSM) (Rienecker et al. 2008; Molod et al. 
2012), the Modular Ocean Model version 4 (MOM4, Griffies et al. 2005) and the CICE sea ice 
model (Hunke and Lipscomb 2008). A skin layer interface serves as a coupler between the ocean 
and atmosphere and mediates the exchange of momentum, heat and fresh water at every time 
step. The skin layer includes the parameterization of the diurnal cycle in the near-surface ocean, 
and the thermodynamics of CICE. The atmospheric component includes a river runoff routing 
scheme. All components are coupled together using ESMF and use the atmospheric time step for 
the coupling interval. 



Figure 2: Ocean grid resolution (km), represented as the square root of the cell area. 


For the assimilation experiments reported here, the atmospheric model uses a Cartesian grid with 
a 1° x 1.25° horizontal resolution and 72 hybrid vertical levels with the upper most level at 0.01 
hPa. The nominal resolution of the ocean grid is V 2 0 , with a meridional equatorial refinement to 
%°. It is a regular Cartesian grid south of 65°N, and curvilinear north of 65°N, with two poles 
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located on land to eliminate the problem of vanishing cell area at the geographic North Pole 
(Figure 1). The resolution of the resulting tripolar grid is depicted in Figure 2, which shows the 
square root of cell areas in units of kilometers, with a minimum and maximum resolution of 15 
km and 52 km, respectively. The ocean topography, shown in Figure 3, is derived from the 
ET0P05 data set. 



Q 500 1«W 1500 2000 2500 3000 3500 4000 4500 5000 


Figure 3: Ocean grid topography (m). 


The CM1 framework has been used in support of the decadal prediction activities under the 
Coupled Model Intercomparison Project Phase 5 (CMIP5; Ham et al. 2012) as well as the GMAO 
seasonal forecasts that are contributed to the National Multi-Model Ensemble (NMME) project. 

The second configuration (CM2) is similar to CM1, except that the data-atmosphere module of 
GEOS-5 is used so that surface forcing from the atmosphere is prescribed. The skin layer is 
provided with specified hourly fields of: 

• 1 0-meter air temperature 

• 10-meter specific humidity 

• 10-meter winds 

• Sea level pressure 

• Surface absorbed long-wave radiation 

• Surface incoming short-wave flux 

• Precipitation (rain and snow) 

• River run-off. 

To couple the ocean and sea ice component of GEOS-5 with a realistic AGCM, a method in 
which the AGCM is constrained to GMAO’s Modem-Era Retrospective Analysis for Research 
and Application (MERRA; Rienecker et al. 2011) is used. It consists of replacing some of the 
variables of the AGCM with the background fields from MERRA. The fields that are replaced 
every six hours consist of: 

• Surface pressure 

• Pressure thickness 

• Zonal and meridional winds 

• Specific humidity 

• Ozone concentration 

• Potential temperature 
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• Dry temperature. 


3 The Observations 

The evolution of the ocean observing system from 1950 to the present, as used in the GEOS 
iODAS, is depicted in Figure 4. Data assimilated consist of the following: 

• Sea surface temperature (SST) 

o Reynolds (Reynolds et al. 2007) from 1982 to present, 
o CMIP5 SST (Hurrell et al. 2008) prior to 1982. 

• Temperature (7) and salinity (S) profiles from 

o expendable Bathythermographs (XBTs) and Conductivity Temperature Depth 
(CTD) sensors extracted from the EN3 data base (Ingleby and Huddleston 2007) 
with time-varying XBT corrections applied according to Levitus et al. (2009), 
o the tropical moored buoy array (McPhaden et al. 2010) - TAO/TRITON, 
PIRATA, and RAMA arrays, 

o Argo, with profiles from the Argo Global Data Assembly Center (GDAC, see 
http : / / www. us godae. or g/ ar go/ ar go . html ) . 

• Along-track sea level anomalies (SLA) from the Archiving, Validation and 
Interpretation of Satellite Oceanographic (AVISO) merged product from 1993 
onwards. 

The World Ocean Atlas 2009 (WOA09) gridded climatology (Antonov et al. 2010; Locamini et 
al. 2010) is also used to correct some of the model's biases, particularly prior to the Argo era. 

The ice component of the iODAS assimilates sea-ice concentration from the National Snow and 
Ice Data Center (NSIDC) from 1979 onwards and the CMIP5 sea ice concentration prior to 1979. 
The NSIDC product is based on passive microwave observations of ice concentration from the 
Nimbus-7 Scanning Multi-channel Microwave Radiometer (SMMR) and the Defense 
Meteorological Satellite Program (DMSP) Special Scanning Microwave Imager (SSM/I) and 
Special Scanning Microwave Imager/Sounder (SSMI/S). The products have a 25 km spatial 
resolution for both the north and south polar regions. Temporal resolution is every other day from 
October 1978 to July 1987 (SMMR), then daily from August 1987 to present (SSM/I). Ice 
concentrations from CMIP5 and Reynolds are used in areas that are not measured due to orbit 
inclination (poleward of 87.2° for SSM/I and 84.5° for SMMR). 



Figure 4: Evolution of the ocean observing system used in the iODAS. 


4 




Figure 5: Time series of the number of in-situ temperature profiles per month as a function 

of depth. 
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Figure 6: The horizontal distribution of in-situ observations in (a) 1975, (b) 1995, (c) 2003, 

(d) 2005, (e) 2007, and (f) 2010. 
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While the near-surface ocean temperature is relatively well observed during the period of interest, 
that at depth is not. This can be seen in Figure 5, which shows the evolution of the observing 
system for temperature from 1950 to present. While Figure 5 illustrates a significant increase in 
profiles to 2000 m from 2003 onwards because of Argo, Figure 6 shows that the horizontal 
coverage becomes global only from about 2005 onwards. These changes in the observing system 
can result in large changes in the analysis biases, impacting inferred climate variability. In order 
to alleviate some of these artifacts, the WOA09 climatology is assimilated to constrain the 
unobserved ocean towards realistic values. 


4 The Data Assimilation Methodology 

Data assimilation refers to optimization methods that seek the true state of a system by optimally 
combining observations with a physical representation of that system. For all methods used in this 
report, the analysis can be expressed as a linear regression in observation space: 

= xf + 'Z*° 1 p i r i , ( 4 . 1 ) 

where x a is the analysis, x f is the forecast or prior state, the r t are covariances between the state 
and the observations, also referred to as the representers (Kimeldorf and Wahba 1970; Bennett 
1992), N () is the number of observations and the /? z are the representer coefficients. The /?, are 
determined by solving linear system 

( HPH t + R)p = y — Hx f = h, (4.2) 

where H is a linear observation operator, the i-th column of PH T is the i-th representer r h y is the 
vector containing the N 0 observations, R is an observation error-covariance matrix, and h is the 
model-data misfit. 

The iODAS is a sequential ensemble assimilation software system that includes a wide range of 
algorithm implementations, from simple optimal interpolation (OI: Eliassen 1954) to more 
expensive ensemble methods such as ensemble Kalman filtering or particle filtering. The 
reanalysis applications presented here use the ensemble optimal interpolation (e.g., Oke et al. 
2010, Wan et al. 2010) implementation where the time dependency of the covariances is 
neglected and P is estimated from the statistics of model run histories or from combinations of 
model histories. EnOI methods are often competitive with the EnKF because they make up for 
the performance degradation due to neglecting the forecast-error evolution by computing their 
statistics from a much larger ensemble. Keppenne et al. (2013) introduces an additional approach 
to estimating the background error covariances - the spatial approximation of forecast errors 
(SAFE) technique. This approach is used in the GEOS ODAS for the assimilation of surface 
observations such as SST and sea surface salinity (SSS) (see Table 1 below). 


4.1 Covariance modeling and localization 

In the Optimal Interpolation implementation, the covariances are assumed to have a Gaussian 
shape, approximated by the function given by C 0 (z) = C 0 as a special case of equation 

(4.10) of Gaspari and Cohn (1999). Dependencies on the background flow are obtained through 
the application of a localization function that uses Euclidean distance in T, S and density space. 
For example, a continuous form of the i-th representer can be written as 


r i (x, y, z, t) = Aa t C 0 (max + 


ly;-yl _j_ \zj-z\ 




(4.3) 


where {x i} y [} z i} tj) is the space-time position of an observation, a t is an estimate of the 
background-error variance at the location of the observation, u is T, S or density, and L x etc. are 
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the vanishing-correlation length scales. The parameter X is an inflation factor that is determined 
by 

; 2 \\diag(R)\\ 

7 \\diag(HPH T )\\ 9 (4.4) 

where y is prescribed and represents the target ratio of background error standard deviation to 
observational error standard deviation and where || || represents the L 2 vector norm. Finally, cp is a 
smooth, horizontally varying field that imposes the constraint that a representer does not cross 
major land boundaries; for example, a representer corresponding to an observation in the Gulf of 
Mexico will not influence the western Pacific. 

The SAFE implementation is similar to OI, except that the amplitude a t of the Gaussian 
covariances in (4.3) and cross-covariances are estimated from the model state itself. The 
approach is outlined here. A more complete description can be found in Keppenne et al. (2013). 

For simplicity of notation, the model state is split according as x = [v, w], where v is an observed 
model field and w is unobserved. The background error covariances are 



Then the analysis state update is split according to 

v a = yf + Av = yf + P vv H t [HP vv H t + /?] _1 [y - H(y 7 )], (4.5a) 

p wv 

w ijk = w iik+-^ Av ’ i = 1. ■■■»/;/ = 1 (4.5b) 

where /, J and K denote the number of grid cells along the x, y, and z space dimensions, 
respectively. 

The first step is to estimate the background-error variance of the observed field with 

<rlv = diag(P vv ) = 0([u - 0(u)] 2 ), ( 4 - 6 ) 

where 0 is a 3D local averaging operator. The iODAS implementations presented here used 
Gaussian smoothing, which was found to give better results than simple averaging with uniform 
weights. The size of the regions over which the 0 smoothing operator is applied is of little 
consequence, except close to continental boundaries (Keppenne et al. 2013). The variance 
estimate is rescaled such that X = 1 in (4.4). 

The covariance, co xl , of the background errors for v at locations (x\, y u z x ) and (x 2 , z 2 ) is 

estimated as 


^12 


= < 7 , 


v 1 v 1 (J v 2 v 2 C( 


(max j 


\x1-x2\_ _j_ Ivi-yzl + 


\Z!~Z 2 \ 

Lz 


|Vl— V 2 | )\ 
tv )) 


This formula ensures that a>i 2 is 0 if either Vi differs significantly from V 2 or the locations are very 
distant from each other. After estimating the background error variances and covariances, the 
update (4.5a) is applied. 

To update the unobserved variable, the local cross-field background error covariances of v and w 
in every grid cell are estimated as 

<?Sw = ®([ u - 0(v)][w - ©0*0]. 


The fields of unobserved variables are then updated according to equation (4.5b). 
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Unlike the SAFE or OI, the EnOI uses covariances estimated from an ensemble of model states. 
Spurious long-range covariances are filtered using localization in space, while dependencies on 
the background flow are obtained in the same manner as the OI method. 

The ocean and sea-ice analyses presented here from ODAS4.0 and ODAS5.0 used the EnOI 
while ODAS5.1 and ODAS5.2 used a configuration that combines OI, SAFE and EnOI. 


5 System Evolution 

5.1 GEOS ODAS4.0 

The ocean retrospective analysis using GEOS ODAS4.0 spans 52 years from 1960 to 2011. The 
reanalysis used the CM1 model framework described in Section 2 and the EnOI method to 
assimilate ocean and sea-ice observations while the atmosphere was constrained to MERRA. 
Prior to the Argo period, the model biases were corrected by assimilating 10% of the 
climatological profiles of temperature and salinity, randomly selected, from WOA09 every 10 
days. 

The ensemble used for estimating the background error covariances consisted of the 20 leading 
empirical orthogonal functions (EOFs) of an ensemble of 900 ocean states. To generate the 
ensemble, a 20-member ensemble of freely running GEOS-5 coupled simulations was integrated 
over a period of 3 years. Each ensemble member was started from the same initial condition with 
a perturbation proportional to the vertical gradient of the local T and S added to the initial 
condition for T and S , respectively. The spread (measured by total variance) across this ensemble 
saturated after about 420 days. After this spin up period, snapshots of the deviation of the 
ensemble about the mean were saved every 15 days and stored in a “super ensemble” of 900 
members. The 20 leading EOFs of this super ensemble were used to represent the background 
error covariances. 

This version of the analysis contributed to the Xue et al. (2012) study that compared upper ocean 
heat content from several analyses and used the ensemble of analyses to estimate the uncertainty 
in climate signals in the recent decades. GEOS ODAS4.0 products are available online at 
http://dp6.nccs.nasa.gov/las/getUI.do . 

5.2 GEOS ODAS5.0 

The GEOS ODAS5.0 analysis used the same CM1 framework as GEOS ODAS4.0, except that 
the observation errors were increased substantially. This was done because of some unrealistic 
features that were occasionally found in the tropical Atlantic in the earlier analysis. The 
ODAS5.0 used the same ensemble for the EnOI as was used for ODAS4.0. The analysis was 
conducted in three different streams, each spun up for at least two years and started from the 
WOA09 climatology. Stream 1 covered December 1959 to December 1980; stream 2 covered 
January 1981 to December 1997; and stream 3 covered January 1998 to December 2011, at which 
point it was discontinued. This version of the analysis was used to initialize the GEOS-5 decadal 
predictions of Ham et al. (2012). 

5.3 GEOS ODAS5.1 and ODAS5.2 

One of the purposes of the GEOS ODAS5.1 and ODAS5.2 ocean and sea-ice analyses is to 
provide consistent initial conditions for all the model components present in the CM1 framework 
as an initialization of seasonal hindcasts and forecasts with GEOS-5, from 1980 to present. CM1, 
replaying MERRA and assimilating SST, SSS and sea-ice observations, was used to provide all 
initial conditions except for the ocean. Because of discrepancies in the climatology of the 
hydrological cycle in MERRA (Reichle et al. 2011), a correction to the MERRA precipitation 
fields was applied during the replay. The correction is based on the v2.1 product from the Global 
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Precipitation and Climatology Project (GPCP) prior to September 2009 and the Climate 
Prediction Center (CPC) Merged Analysis of Precipitation (CMAP), rescaled to be consistent 
with the GPCP climatology, from September 2009 to the present. 

GEOS ODAS5.2 differs from ODAS5.1 in several ways: along-track altimeter data were 
assimilated and no climatological profiles of T and S were assimilated during the Argo period. 
ODAS5.2 was started at the beginning of the altimeter period in January 1993 from initial 
conditions provided by ODAS5.1. Due to time constraints, the assimilation of all ocean 
observations except sea-ice concentration was performed within the CM2 framework, forced with 
atmospheric fluxes obtained from the CM1 simulations. For ODAS5.1, the fluxes were obtained 
from the CM1 run described above, replaying MERRA but with a correction to the precipitation, 
assimilating SST, sea-ice concentration and SSS climatology. Unfortunately, one of the artifacts 
of the precipitation correction is the loss of the water budget closure, which resulted in the ocean 
component losing water. Hence, ODAS5.2 followed the same procedure, but with no correction 
to precipitation. 

Since the ODAS5.0 analysis did not agree with other analyses as well as ODAS4.0 did, the 
observation errors were adjusted again for GEOS ODAS5.1. An updated static ensemble was 
used for both ODAS5.1 and ODAS5.2. The updated ensemble was constructed by taking the 
leading EOFs of an ensemble of forecast anomalies (calculated relative to the climatological drift) 
obtained from GEOS-5 AOGCM seasonal hindcast tests initialized from the ODAS5.0 analysis. 

5.3.1 Assimilation set-up 

The observations are assimilated every 5 days, with observations in a 10-day window centered at 
the time of the analysis. Seven sequential analyses of different types of observations are 
performed in sequence. While the lc-th analysis uses the state from the (k-l)-th analysis for its 
background (first guess), the variance of the static ensemble used for the background error 
covariance stays the same. The sequence of analyses is as follows: 

1 . T(z) and S(z) from WOA09 at 1° resolution and from 0 to 4500m 

2. Sea surface salinity (SSS) from WOA09 

3. Sea level anomalies (SLA) 

4. SST 

5. T(z) from in-situ profiles 

6. S(z) from in-situ profiles 

7. Sea-ice concentration. 

The method and parameters used for each sequence of analysis are given in Table 1 for the 
parameters concerning the background error covariance P, and Table 2 for the observation error 
covariance matrix R. The justification for the choice of parameters is discussed in the following 
sections. 
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Table 1: Parameters used for the representer functions of equation (4.3) in ODAS5.1 and 5.2. The 
first column is the analysis sequence. Lines 1, 2, 4 and 7 give the parameters for the Gaussian 
shaped covariances. Lines 3, 5 and 6 are the parameters for the localization function used to 
correct the sampled covariance obtained from the static ensemble. L x and L y are in degrees of 
longitude and latitude, respectively. 
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Ly 

[deg] 

L z 

[m] 

L t 
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[°C] 

Ls 

y 
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1 

1 

1 
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1.0 
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S 
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X 

SLA and T 

4 

5 
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10 

2 


0.2 

X 


T 

5 

30 

15 
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10 

5 


0.2 


X 

T and 5 

6 

30 

15 
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10 


0.5 

0.2 


X 

s 

7 

8 

4 

25 

10 



1.0 


X 

Tand S 


Table 2: Scales used for the observational covariance function, R. The first column is the analysis 
sequence. L x and L y are defined as in Table 1. 



L y 

[deg] 

Ly 

[deg] 

L z 

[m] 

L t 

[day | 

1 

1 

1 

0 

0 

2 

2 

1 

0 

10 

3 

2 

1 

0 

10 

4 

2 

1 

0 

10 

5 

0 

0 

50 

10 

6 

0 

0 

50 

10 

7 

0 

0 

0 
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5.3.2 Climatological constraint 

Sequences 1 and 2 constitute a constraint by the WOA09 climatology of T and S. The reasons for 
using this constraint are two-fold: (1) to force the model to a reasonable climatology, and (2) to 
correct for errors in fresh water fluxes and runoff by assimilating SSS. 

Because of the relatively high resolution of the WOA09 climatology (1 degree horizontally and 
40 levels vertically), the spatial scales imposed are quite small (line 1 of Table 1) to reduce the 
computing cost of the first assimilation. The inflation factor was adjusted through trial and error 
until a realistic climatology was obtained, while the variance of the system was still acceptable. 
The observational error variance was set to be 100 times the relevant diagonal element of P. 

The parameters for the assimilation of SSS are given in the second line of Table 1. Only 10% of 
the SSS climatology data points, randomly selected, are included per assimilation window. The 


10 


horizontal scales and inflation factor were also adjusted through trial and error with a similar 
target as for the assimilation of climatological profiles. The vertical influence of the SSS 
observations is limited to 100 m or the depth at which the salinity difference is 0.5 from the 
background at the surface, whichever is the shallowest. Figure 7 shows the observation error (the 
square root of the diagonal of R ) used for SSS. It was estimated by calculating the mean 
difference between WOA09 and Argo surface salinity in 1 -degree bins over a period of 8 years 
(2003-2010). The observation error was reduced in regions of large runoff to 0.001. 



0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 0.22 


Figure 7: Estimation of the observation error for SSS. 


5.3.3 Along-track sea level anomalies 
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Figure 8: Example of an estimate of the observation error standard deviation for SLA (m). 

The SLA observations used in GEOS ODAS5.2 are along-track merged products obtained from 
AVISO, referenced to a 7-year mean (January 1993 to January 1999). Because of the high 
resolution of the AVISO product (adjacent SLA measurements are approximately 20 km apart), 
the along-track observations are high-pass filtered using a Gaussian kernel with a stencil size of 
21 and a decorrelation length scale of 100 km. The along-track variability over 200 km segments 
is used as a proxy for the errors of representativeness; a snapshot of that estimate is depicted in 
Figure 8 . No instrument error is imposed. 

The model SLA is given by 77 ' = 77 — 77 , where the mean sea level 77 , also referred to as the mean 
dynamic topography (hereafter, MDT), is the 7-year mean sea level of GEOS ODAS5.1 from 
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January 1993 to December 1999 (see Figure 9) and 77 is the total sea level from the barotropic 
component of the ocean model. 



Figure 9: The model mean sea surface height (m), January 199 3 -December 1999. 

Ta from sla and sst EN3 Ta 
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Figure 10: Upper ocean temperature anomaly (°C) at selected depths (25 m, 55 m, 105 m, 
and 225 m) for December 1997. Left column is the off-line analysis based only on SLA and 
SST; the right column is the EN3 objective analysis. 
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Within the EnOI framework used for the assimilation of the SLA, an increment in 77 is 
accompanied by an increment in the three-dimensional T field (see last column of line 3 of Table 
1). A poor estimate of the T could be caused by the wrong choice of MDT ( 77 ) or wrong cross- 
covariances between SLA and T. Whether the MDT chosen in the analysis is appropriate is still 
an active area of research and will not be discussed in this report. In order to separate the two 
sources of error, and only investigate the robustness of the cross-covariances, an objective 
(model-less) analysis was carried out, in which SLA are used as a predictor for T anomalies 
(hereafter, T') over the entire water column, therefore avoiding the need for an estimate of the 
MDT. To simplify the vertical projection, the daily gridded SLA from AVISO is used. Further, we 
assume that the errors are horizontally uncorrelated, an observation of SLA at ( x,y ) is only 
influencing a single water column of T'(z) at (x,y). Using this simplification, the problem of 
estimating the three-dimensional T r reduces to many independent one-dimensional vertical 
projections (as many as there are observations). For this purpose, the entire set of EOFs are used 
(286) for the computation of the covariance between 77 andT'(z) at location (x,y). Because of 
the relatively large size of the ensemble, localization of the vertically dependent covariance is not 
necessary. Figure 10 shows the monthly average of the upper ocean T' for December 1997, 
estimated by the vertical projection described above and compared to the EN3 objective analysis. 
The upper ocean structure of the El Nino phase, characterized by a large positive V in the eastern 
tropical Pacific and a large negative T' in the west, is well represented in the analysis, suggesting 
that the static ensemble is adequate for the projection of SLA onto the three-dimensional T' in the 
tropical Pacific. Similarities are observed in the extratropics as well, but the off-line analysis 
features small scales that are clearly not present in EN3. It is doubtful that these small-scale 
features are realistic; rather, they are artifacts of assuming that the errors are horizontally 
uncorrelated. 
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Figure 11: Average observation error for Reynolds SST (°C). The upper (lower) panel is the 

average for 2002-2011 (1982-2002). 
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5.3.4 Sea surface temperature 

The assimilation of SST is fourth in the sequence depicted in Table 1, correcting the upper 
analyzed ocean heat content that was a result of the earlier sequence of assimilations within the 
same time window. When assimilating CMIP5 data (prior to 1982), the observation errors are set 
to 0.5°C, globally. After 1982, when assimilating Reynolds SST, the observation errors are given 
by the error in the optimal interpolation used in the creation of the gridded SST retrieval. The 
mean of the error from 1982 to 2011 is given in Figure 11. The assimilation methodology uses the 
SAFE algorithm with Gaussian horizontal and vertical covariances. The vertical influence of SST 
is limited to the minimum of 500 m and the depth at which the temperature differs by 2°C from 
the temperature at 5 m. 

5.3.5 In-situ profiles of temperature and salinity 

The assimilation of in-situ profiles of temperature and salinity correspond to sequences 5 and 6 of 
Table 1. The observations are assimilated twice, in a multi-scale fashion, with the second 
assimilation being identical to the first except for the horizontal localization scales that are 50% 
of the scales used in the first assimilation. While both T and S analyses use the EnOI algorithm, 
only step 5, consisting of the analysis of T profiles, is multivariate (increments in T have 
corresponding increments in S ). Step 6 consists of the univariate assimilation of S profiles. Since 
most S profiles have a corresponding T profile, salinity is not used to correct temperature. 

Prior to being assimilated, both T and S are binned according to the model’s levels and any profile 
that contains an observation that is six or more standard deviations (where the standard deviation 
is taken to be the standard deviation of the interannual variability) away from the WOA09 
climatology is rejected. The observation error is modeled with an exponential decay written as 

cr(z) = a 0 exp (- £) + a b (5.1) 

where a t is the instrument error and the exponential decay term is a proxy for the errors of 
representation, parameterized with the e-folding depth scale z 0 = 500 m and amplitude a 0 . The 
parameters necessary to specify the observation error according to equation (5.1) are given in 
Table 3. 


Table 3: The parameters used for the observational variance function of 
equation (5.1) for observation types used in the GEOS ODAS5.2. 
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The covariances estimated from the static ensemble are localized using equation (4.3) and the 
parameterization given in lines 6 and 7 of Table 1. When assimilating T (or S ), v in equation (4.3) 
is T (or S). This localization choice allows for small vertical scales above the thermocline, where 
the extent of vertical scales is mostly limited by the localization in T (or S ), and large scales 
below the thermocline where the vertical influence of the observation is limited by L z . 

5.3.6 Sea-ice concentration 

The last sequence consists of the assimilation of sea-ice concentration, from NSIDC or CMIP5. 
Sea-ice concentration is defined as the fraction of ocean area covered by sea ice and assumed to 
be the same variable as the sum of the five types of fractional sea ice in the CICE model where 
the total fractional ice coverage is 

aice — 0 if there is no ice 

aice = 1 if there is no open water 

0< aice < 1 if there is both ice and open water. 

Since the total concentration, aice , is a diagnostic variable in CICE, one would need to project the 
observations onto the five types of ice concentrations. An implementation in which the observed 
ice fraction was decomposed into the five prognostic types of ice fraction using the same ratios as 
the background was found to be numerically unstable most of the time. Instead, T and S 
increments are computed using the covariances between aice , T and S from the static ensemble. 
Again, to account for the small size of the ensemble, and the resulting errors in the estimation of 
distant correlations, a localization procedure is used. The parameters of the localization are given 
in line 8 of Table 1. The observation error is prescribed as 0.05. 

6 Results and diagnostics 

6.1 Active tracers 

The advent of the Argo observing system has had a large impact on ocean analyses, so much so 
that the sharp changes in the climatology of the analysis post and prior to the Argo period is most 
often largely attributed to the higher resolution of these observations, rather than a real change in 
the climate of the ocean. This is illustrated in Figures 12a and b, which show the root mean 
square (RMS) observation-minus-forecast (OMF) for T and S , respectively. For both T and S , the 
most striking change occurs below 300 m, where the ocean was largely unobserved prior to Argo, 
while the improvement in the upper 300 m is more subtle and localized to areas of relatively 
small variability. 

When looking at the time series of T averaged over the upper 300 m, a useful proxy for the heat 
content in the upper 300 m, a similar artifact is observed, where the analysis (red line in the upper 
panel of Figure 13) converges to the EN3 objective analysis, suggesting a reduction of the 
uncertainty for the state of the global T during the period of maturity of the Argo deployment. 
However, this is not true when looking at T averaged over the upper ocean in the tropical band, 
30°S-30°N (left-hand panel of Figure 14), where the difference between the two analyses remains 
similar throughout the period shown. This is due to the availability of data from the tropical 
Pacific moorings, especially the TAO/TRITON array, which constrains both the analysis and EN3 
and reduces the uncertainty in T. The global mean salinity from the GEOS analysis is closer to the 
WOA09 climatology than to the salinity in EN3. The difference from EN3 is about 0.02. The 
agreement between the GEOS analysis and EN3 is much closer in the tropical band (Figure 14). 
Interestingly after 1999, there is an offset between the two of about 0.01. 
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Figure 12a: The RMS of OMF for temperature (°C). The left columns are for the 1993-2005 
period, the right columns for 2006-2011. The upper panels are for the upper 300 m; the 
lower panels for 300-1000 m depths. 


Figure 12b: As for Figure 12a, but for salinity. 
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Figure 13: Left: Global mean temperature (°C) averaged over the upper 300 m. The red line 
is for the GEOS ODAS5.2; the dashed grey line is for EN 3, the dashed blue line is from 
WOA09. Right: Salinity averaged over the upper 300 m. 
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Figure 14: As for Figure 13, but for the tropical band, 30°S-30°N. 
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6.2 Statistics from the assimilation process 

To interpret the fit of the analysis to the observations, we first look at the global statistics of the 
reduced penalty functional (see equation 6.1 below). Following Bennett et al. (1998) we can use 
our data assimilation system to test an hypothesis, in which the null hypothesis is based mainly of 
estimates of observation errors, errors of representation and model errors. The residuals, or the 
posterior value of the cost function, from the linear regression in equation 4.1, are written as 

J = h T (HPlf r + Rr 1 h, (6.1) 

where h is the innovation (or OMF). / follows a x 2 distribution with the number of degrees of 
freedom equal to the number of observations processed in the assimilation. In the limit of large 
N 0 , the/ 2 distribution is approximated with a normal distribution with first moment E(xn 0 ) = 
N 0 and second moment var(xN 0 ) — 2iV 0 . The test is summarized as follows: 

If / < N 0 , then the value chosen for y in equation (4.4) was too large and the model over-fits 
the observation. 

If J > N 0 , then y should have been larger. 


Time series of the normalized value of / for each observation type are shown in Figure 15. 
Because of the normalization, the expected value of / is 1. 
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Figure 15: Normalized cost function, j, for each sequence of assimilation. 

The black line in Figure 15 is the ] corresponding to the assimilation of WOA09 SSS, it has a 
mean of 0.19, indicating that the observation errors are too small or the background error variance 
estimated from SAFE is too large. The same is true for the assimilation of SLA (red line), which 
has a mean value of 0.29 for /, indicating that the estimated observation and representation errors 
are too small, or the variance of the static ensemble is too large. The yellow line of Figure 15, 
corresponding to the assimilation of SST, has a mean well above 1 and sharp changes in 2003 and 
2011, corresponding to a significant change in the observing system and the instrument error 
associated with it (see Figure 11). The total error estimate for the SST observations do not include 
errors of representation, but rather the error of the OI algorithm used to grid the Reynolds SST 
field, providing a lower bound estimate for the diagonal of R. This implies that the model error 
for SST was underestimated, and the surface variance of T in the static ensemble underestimates 
the background error variance. The purple line of Figure 15 corresponds to the evolution of / for 
the assimilation of ice fraction over both polar regions. It includes a strong seasonal cycle with a 
minimum close to 1 in April and a maximum between 10 and 15 in September, indicating that the 
observational error and the covariance model should have included a seasonal component. 

The/ 2 statistic for the in-situ profiles of T 7 , depicted as the blue line in Figure 15, is well over 
unity for the entire analysis period, suggesting that our background covariance model has too 
little amplitude and our estimate of representation error was possibly too large. Finally, the cyan 
line of Figure 15, representing the evolution of / for the assimilation of in-situ S profiles, starts 
off significantly larger than 1 but slowly converges to its expected value during the Argo period, 
indicating that our models for the representation error and background error are adequate during 
the more recent years. 

No attempt was made to adjust y so that the normalized / would be close to its expected value of 
1; instead, the ratio of observation error to background error was adjusted so that the forecast 
would be numerically stable. 

The temporal evolution of the global RMS of OMFs and OMAs (observation minus analysis, also 
referred to as analysis departures) for the surface observations, SST, SSS, SLA and aice, is shown 
in Figure 16. The global RMS of OMFs corresponding to the in-situ observations of T and S are 
analyzed in the time-depth plane and shown in Figure 17. As expected, the larger errors for T and 
S are found within the thermocline. 
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Figure 16a: The solid blue line is the global RMS of OMFs for SST (°C). The mean RMS is 
0. 72°C and the standard deviation is 0.06°C. The dashed blue line is the RMS of OMA. The 
grey line is the observation count per assimilation window (right-hand ordinate). 



Figure 16b: As for figure 16a, but for SSS. The mean RMS is 0.26 and the standard deviation 

is 0. 05. 
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Figure 16c: As for figure 16a, but for SLA (m). The blue lines correspond to the 
TOPEX/Poseidon period; the red lines are for the Jason- 1 period and green for Jason-2. 
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Figure 16d: As for figure 16a, but ice fraction over the Arctic. 
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Figure 16e: As for figure 16a, but ice fraction over the Antarctic. 
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Figure 1 7 a: Time series of global RMS of OMF for temperature (°C) as a function of depth. 
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Figure 1 7b: Time series of global RMS of OMF for salinity as a function of depth. 


Summaries of the mean and RMS statistics from the innovations and analysis departures averaged 
as a function of depth for 2002 to 2011 are presented in Figures 18 and 19 for the tropics (30°S to 
30°N) and extratropics (60°S to 30°S and 30°N to 50°N) for CTDs and Argo. 

For CTDs and Argo, the RMS departures are largest in the northern extratropics for both T and S, 
with the background and analysis being biased warm relative to the CTD observations throughout 
the upper 1800 m. This is also the case over most of the upper 1800 m in the southern 
extratropics. The RMS errors for temperature relative to Argo in the upper 1000 m and salinity in 
the upper 200 m in the northern extratropics are smaller than those relative to CTD observations. 
This is most likely due to the much larger number of Argo observations that are then more 
effective in constraining the analysis. The structure and magnitude of the RMS errors in the 
tropics and southern extratropics are similar relative to both CTD and Argo observations. In 
general, both background and analysis have a very low bias relative to Argo. 
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Figure 18: Mean (dashed line) and RMS (solid line) of the OMF (red) and OMA (blue) for 
CTD temperature (left) and salinity (right), in the southern extratropics (upper), tropics 
(middle) and northern extratropics (lower). The statistics are calculated for 2002-2011 as a 

function of depth. 
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Figure 19: As for Figure 18, but for Argo. 
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6.3 Sea-level 


6.3.1 Comparison with altimeter and tide gauge data 

Not surprisingly, the comparison of sea level from the analyses with observations (Figure 20) 
indicates that the assimilation of along-track SLA data improves the analysis agreement with the 
gridded AVISO product. Even with assimilation, there are large discrepancies in regions of strong 
currents and large variability. The high agreement of ODAS5.1 with the altimeter data in the 
equatorial Pacific comes from the high quality of data from the TAO moorings. Interestingly, the 
ODAS5.1 agrees with the altimeter data better in the Indian Ocean than the Atlantic Ocean, 
presumably because of the strong seasonal cycle in that basin and the good distribution of Argo 
data late in the observing period. 




0.1 0.2 0.3 0.4 O.S 0.B 07 0.3 0.0 



10 


13 20 


Figure 20: The left-hand panels show the correlation of SLA from the GEOS analyses from 
1993 to 2011 with gridded SLA from A VISO. The upper plot is for the GEOS analysis that 
assimilates the SLA data; the lower plot is from the analysis that does not assimilate SLA 
data. The right-hand panels are the RMS difference (cm) between the analyses and the 

AVISO data. 




RM5G (cm}: GEOS OOA5S.1-AVIS0 


The quality of the analysis with altimeter data assimilated can also be evaluated by comparing 
with the independent data from tide gauges, with the caveat that many tide gauges are not 
representative of open ocean conditions. Figure 21 shows the difference between the RMS of 
analysis departures from tide gauge data - the RMS from the analysis with SLA data minus the 
RMS from the analysis without SLA data. The predominance of negative values, the blue-green 
colors, show the analysis with SLA data assimilated compares better with the tide gauge data than 
that with no SLA assimilation. The time series of differences from tide gauge data and from 
AVISO SLA are shown for a few gauges in the equatorial waveguide. The comparison shows that 
the differences from the tide gauge data are consistent with the differences between the tide 
gauges and the AVISO product. 
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Figure 21: RMS of OMF for tide gauge data (passive): analysis with SLA data minus 
analysis without SLA data. Negative values indicate that SLA assimilation has a positive 

impact. 
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6.3.2 Global sea level variations 

In this section, the impact of the assimilation on global sea level and water volume budget is 
presented. When integrated globally, the two constituents of the mean sea level are steric height, 
that can be further decomposed into halosteric and thermosteric height, and volume input from 
outside sources. In the case of the CM2 configuration, the outside source of water mass to the 
ocean is the imbalance between run-off water and atmospheric fresh water fluxes. The steric 
height anomaly r] ST is defined as 

p ST (x, y, t) [p(T(x, y, z, t),S(x, y, z, t)) - p(T 0 (x, y, z, t), S 0 (x, y, z, t))], 

P 0 J-H 

where p 0 ,T 0 , andS 0 are climatological values. It represents the contraction or expansion of 
seawater due to thermal and haline variability. The haline contribution, t] HST , is defined as 

1 f 0 

rj HST (x,y,t ) = [p(T 0 (x, y, z, t),S(x, y, z, t)) - p(T 0 (x,y,z, t),S 0 (x,y,z, t))]. 

P o J-H 

The thermal contribution, ri TST , is defined as 
1 f 0 

r] TST (x, y, t) = [p(T(x, y, z, t), S 0 (x, y, z, t)) - p(T 0 (x,y,z, t),S 0 (x,y,z, t))]. 

P o J-h 



Figure 22: Area-weighted average of the contributions (haline is dashed green and thermal 
is dashed black) to changes in steric height (solid black line) and sea-level anomalies (yellow 

line) in cm, referenced to 1993. 

Figure 22 shows the global area- weighted yearly mean steric height 77 57 , its haline and thermal 
components and the sea level, all referenced to 1993. The overall increase in sea level over this 
period is consistent with that inferred from altimeter and Argo data (Leuliette and Willis 2011). 
The major contribution to steric height changes during the period of the analysis is from the 
thermal component, rf ST . From the slope of the solid black line and the yellow line of Figure 22 
one can infer that the sea level rise from 1993 to 1999 is of both thermal and external origin (land 
ice melt and changes in precipitation, evaporation, and run-off). After 2004, the analyzed steric 
height change has a significant contribution from the halosteric component, r / HST . This change, 
which must arise from the “sudden” availability of new global information on the salinity 
distribution, is artificial and highlights the issues that can arise in inferring climate variability 
with a changing observing system. In contrast, Leuliette and Willis (2011) find that after 2005, 
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the sea level rise has a greater contribution from an increase in the ocean mass due to the input 
from melting glaciers and ice sheets than from thermosteric effects. 

6.4 Thermohaline circulation 

The Atlantic meridional overturning circulation (AMOC) is a major component of the 
thermohaline circulation and an indication of the state of the climate system since it quantifies the 
meridional transport of heat. Several of its constituents are depicted in Figure 23, showing the 
meridional transports at 26.5°N. Positive transports correspond to northward flow. The solid red 
line is the analysis estimate of the Florida Strait transport, while the dashed red line is the cable 
transport estimate. The analysis transport is slightly weaker than observed, possibly due to the 
relatively low resolution of the ocean model. The solid green line is the GEOS Ekman transport. 
The dashed green line is the Ekman transport estimated from the Cross Calibrated Multi-Platform 
(CCMP) wind product (see http://www.noc.soton.ac.uk/rapidmoc/) and is in reasonable 
agreement with the GEOS analysis. The mid-ocean transport, calculated as the vertical integral of 
the transport per unit depth down to the deepest northward velocity (-1100 m) on each day, closes 
the transport budget. The sold gold line is the GEOS ocean analysis estimate, which agrees well 
with the RAPID-MOC estimate (e.g., Rayner et al. 2011). The overturning transport is then the 
sum of the Florida Straits, Ekman, and upper mid-ocean transports and represents the maximum 
northward transport of upper-layer waters on each day. The mean GEOS ocean analysis estimate 
for the maximum AMOC is 14.0 Sv with a standard deviation of 3.2 Sv over the period of the 
analysis; this is weaker that the RAPID estimate of 17.4 Sv with a standard deviation of 4.9 Sv. It 
is likely that a major contribution to the weak estimate is the underestimation of precipitation or 
the overestimation of evaporation over the Atlantic between 5°N and 32°N, as inferred from 
Figure 24 which shows the overturning stream function is significantly weaker in this region. The 
total transport, calculated top to bottom and across the Atlantic section from Florida to Africa, is 
the black line of Figure 23; the apparent non-closure reflects the transport leakage through the 
Bering Strait and the fresh water flux from the forcing. 



Figure 23: Contributions to the AMOC transport (Sv) at 26.5°N. Black line: total transport 
from the GEOS ODAS5.2 analysis; solid red line: analysis estimate of Florida Strait 
transport; dashed red line: cable measurement of Florida Strait transport; solid green line: 
analysis Ekman transport; dashed green line: Ekman transport estimated using CCMP 
winds; solid gold line: analysis estimate of mid-ocean transport; dashed gold line: mid- 
ocean transport from the RAPID array. The blue lines show the maximum AMOC transport, 
with the solid line from the analysis and the dashed line from the RAPID array. 
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Figure 24: The Eulerian mean meridional overturning circulation in the Atlantic (in Sv). The 

contour interval is 2 Sv. 



Figure 25: Upper panel: Bottom topography in the subpolar North Atlantic, showing the 
Labrador Sea (LS), the Irminger Sea (IS), the Icelandic Basin (IB), and the Reykjanes Ridge 
(RR). Lower panels: The GEOS ODAS5.2 analysis of mean May-June salinity, 1995-1997, in 
the subpolar North Atlantic. The left panel shows salinity averaged over 36.92 < o 2 <36.94, 
the density range for LSW following Yashayaev (2007). The right panel shows salinity on the 
a 2 =37.00 kg m 3 surface associated with the NEADW. Salinity observations are over 
plotted on the GEOS ODAS5.2 analysis. 

One of the driving forces of the thermohaline circulation is the formation of deep cold water in 
the North Atlantic, commonly referred to as North Atlantic Deep Water (NADW). The most 


29 



prominent physical process for the formation of the NADW is evaporative cooling of surface 
water when heat is lost through latent cooling and salt concentration increases through 
evaporation. The resulting dense water sinks and forms the NADW. Another, less common, 
process for the formation of NADW is deep convective cooling, which only occurs in places of 
weak stratification, such as the Greenland and Labrador seas. The representation of Labrador Sea 
Water (LSW) in the GEOS ocean analyses is of particular importance because it is a principal 
contributor to the lower limb of the AMOC, which is of interest because of its potential 
predictability. 

Figure 25 shows the mean salinity over the subpolar North Atlantic for May-June 1995-1997 on 
isopycnal surfaces 36.92 < o 2 <36.94 and o 2 = 37.00 kg m" 3 . The former is associated with LSW 
(potential temperatures < 2.8°C and S < 34.84), typically seen at depths of 500-2000 m; the latter 
represents the high-salinity core of the Northeast Atlantic Deep Water (NEADW), typically seen 
at 2700-2900 m. During this period the Labrador Sea reached an extreme cold and fresh state and 
LSW filled the entire central part of the Labrador Sea basin from 500 to 2400 m (Yashayaev 
2007). 

The salinity analysis within the 36.92 < o 2 <36.94 kg m' 3 volume captures the very fresh water in 
the Labrador Sea basin, its advection around the southern limb of the subpolar gyre and also into 
the Deep Western Boundary Current. The freshest water is transported towards the Irminger Sea 
through the cyclonic recirculation gyre of the central Labrador Basin (Pickart and Spall 2007); 
however, the analysis is not as fresh as observed. 

The analysis at o 2 = 37.00 kg m' 3 is too fresh in the Iceland Basin, but it captures the NEADW 
fresh water plume that spreads around Reykjanes Ridge and into the Irminger Sea. The NEADW 
continues to freshen as it continues south along the western boundary of the Labrador Sea. The 
analysis also captures the strong salinity maximum of the Mediterranean water that spreads north 
the Iberian Peninsula. 

This direct comparison with the observations revealed that, through a technical glitch, not all of 
the observations in the Irminger Sea were assimilated. This problem has now been rectified in a 
new (ongoing) analysis. 

Deep convective cooling in the Labrador Sea was also observed in 2008 when observations were 
available from Argo. The resulting ocean structure is illustrated in Figure 26, which shows the 
horizontal distribution of temperature and salinity of the intermediate water (750 m). The major 
convection area (central Labrador Sea) is associated with the lowest temperatures and salinities, 
while the minor convection area (central Irminger Sea) has the highest values. The GEOS 
ODAS5.2 analysis captures both major and minor convection areas seen in the observations. 
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Figure 26: Salinity (left) and potential temperature (right) at 750 m, averaged from 
February to April 2008, in the subpolar North Atlantic. Observations ofT and S from Argo 
floats are over plotted on the GEOS ODAS analysis in the upper panels. 


6.5 Zonal currents along the equatorial Pacific 

Comparisons of the analyzed zonal currents with those from the Acoustic Doppler Current 
Profiler (ADCP) on the TAO moorings along the equatorial Pacific are shown in Figure 27. The 
climatologies are calculated from 1997 to 2005. 

Although no current data are assimilated in the system, the analyses represent the vertical 
structure and the annual cycle of the currents across the equatorial Pacific very well. The largest 
discrepancies are in the western Pacific where the Equatorial Undercurrent (EUC) core is too 
shallow, the analyzed zonal current is too diffuse below the core of the EUC and the intensity is 
too weak in the boreal winter. The RMS differences reach 0.4 m/s near the surface at 165°E 
primarily because the analyzed undercurrent does not surface in boreal spring as observed. The 
structure of the mean bias is similar at all mooring locations, having a westward flow that is too 
weak near the surface in the South Equatorial Current and eastward flow that is too weak in the 
EUC. The RMS error in the central and eastern Pacific is about 0.2 m/s. 
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Figure 27: The climatology of zonal currents (m/s) from ODAS5.2 and the ADCP on the 
TAO moorings on the equator. The climatology is calculated for the period of 1997-2005. 
The rightmost panels show the mean and RMS of OMFs for each mooring. 
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6.6 Water masses 

The assimilation procedure, using a sequential approach that has a weak constraint to the WOA09 
climatology prior to the Argo period and assimilates T and S profile information last in the cycle, 
appears to be effective in maintaining the observed water mass distributions. The T-S 
relationships are tighter in the Argo period; and prior to the Argo period, the analysis has a 
tendency to include slightly fresher waters than in the WOA09 climatology, especially in the 
Equatorial Pacific and Indian Oceans (Figures 28a and b). At higher latitudes (Figures 29 and 30), 
the analysis, well constrained by Argo, preserves the water masses in the WOA09 climatology 
and in the observations plotted in Talley (2008). 


1995: GEOS ODAS5.2 2011 : GEOS ODAS5.2 WOA09 




Salinity 

Figure 28a: The T-S pairs in the western equatorial Pacific, 165-1 7 0°E, 5°S-5°N, from a pre- 
Argo year (1995), a post-Argo year (2011) and the WOA09. Profiles are over laid on contours 
of (Tq (kg m 3 ). Monthly mean analysis output is subsampled to the same vertical grid as WOA09. 
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2011: GEOS ODAS5.2 
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Figure 28b: As for Figure 22 a, but for the equatorial Indian Ocean, 88.5-92.5°E, 0-5 °N. 



Figure 28c: As for Figure 22 a, but for the equatorial Atlantic Ocean, 37.5-33.5°W, 2.5°S-2.5°N. 
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Figure 29: The T-S pairs from profiles at 24°N, for zonal sections bracketing the Arctic in 
the Pacific sector, and for meridional sections through the northern subtropics and subpolar 
regions, following Talley (2008). Profiles are over laid on contours of a 0 (kg mi 3 ). Monthly 
mean analysis output is subsampled to the same vertical grid as WOA09. 
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Figure 30: As for Figure 29, except for profiles at 30°S, and for meridional sections 
southward from that section, following Talley (2008). 
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7 Summary 

We have presented the technical setup, the system evolution, and some results from the GEOS 
iODAS that uses MOM4 and is constrained by atmospheric forcing from MERRA. Comparisons 
of the upper ocean heat content from the ocean reanalysis with other ocean analyses (e.g., Xue et 
al. 2012) has helped to verify the quality of the analysis. Other, more comprehensive, 
comparisons are underway under the auspices of the Global Ocean Data Assimilation Experiment 
(GODAE) and the CLIVAR Global Synthesis and Observations Panel (GSOP). 

The Appendix presents an atlas of the 2002-2011 T and S climatologies from GEOS ODAS5.2 
and their comparison with WOA09. This analysis, also referred to as MERRA-Ocean, is currently 
being used to initialize an ensemble of CM1 models for the GMAO’s seasonal forecasting efforts. 
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AOGCM 
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CLImate VARiability and predictability 

CM 1(2) 
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Climate Prediction Center (NOAA) 
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Conductivity Temperature Depth 
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Defense Meteorological Satellite Program 

EN3 
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Ensemble Optimal Interpolation 

ESMF 
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GFDL 
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Global Modeling and Assimilation Office, Code 610.1, NASA/GSFC 
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NASA Modeling, Analysis, and Prediction (program) 
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Mean Dynamic Topography 
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Modem-Era Retrospective analysis for Research and Applications 

MOC 

Meridional Overturning Circulation 

MOM4 

Modular Ocean Model, version 4 

NADW 

North Atlantic Deep Water 

NASA 

National Aeronautics and Space Administration 

NCCS 

NASA Center for Climate Simulation (at NASA/GSFC) 

NEADW 

Northeast Atlantic Deep Water 

NMME 

National Multi-Model Ensemble 

NOAA 

National Oceanographic and Atmospheric Administration 

NSIDC 

National Snow and Ice Data Center 

ODAS 

Ocean Data Assimilation System 
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Tropical Atmosphere Ocean (moored buoy array) 

World Ocean Atlas 2009 
expendable Bathythermograph 


39 


Atlas of Climatologies of Selected Fields 

from 

GEOS ODAS5.2 
2002-2011 


40 


GEOS OOASS.2 WOA09- WO AM GEOS ODAS5.2 





t rq 


1 1.25 
LOO 
0.75 
0.50 
0.25 
0.00 
-0.25 
— D.SO 
-0.75 
- 1.00 
-1.25 
-LSO 
TfC] 


I 

§ 

U1 


S 

g 


I 



l 


35.70 

35.40 

35.00 
34-SO 
34.74 

34.70 
34.68 
34.66 
34 60 

34.40 
34.20 

34.00 

33.00 


5 



Figure 31: Potential temperature (°C) (left) and salinity (right) sections across the 
equatorial Pacific. The uppermost panels are from the GEOS ODAS5.2 analysis; the middle 
panels are from WOA09 and the bottom panels show the differences (ODAS minus WOA09). 
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Figure 32: As for Figure 31, but for the equatorial Atlantic Ocean. 
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Figure 33: As for Figure 31, but for the equatorial Indian Ocean. 
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Figure 35: As for Figure 31, but for the 30°S across the Pacific Ocean. 
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Figure 36: As for Figure 31, but for the 50°N across the Pacific Ocean. 
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Figure 37: As for Figure 31, but for the 50°S across the Pacific Ocean. 
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Figure 38: As for Figure 31, but for the 30°N across the Atlantic Ocean. 
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Figure 39: As for Figure 31, but for the 30°S across the Atlantic Ocean. 
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Figure 40: As for Figure 31, but for the 50°N across the Atlantic Ocean. 
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Figure 41: As for Figure 31, but for the 50°S across the Atlantic Ocean. 
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Figure 42: As for Figure 31, but for the 30°S across the Indian Ocean. 
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Figure 43: As for Figure 31, 


but for the 50°S across the Indian Ocean. 
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Figure 44: Potential temperature (°C) (upper set) and salinity (lower set) sections along 
165°E. The uppermost panel in each set is from the GEOS ODAS5.2 analysis; the middle 
panel is from WOA09 and the bottom panel shows the difference (ODAS minus WOA09). 
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Figure 45: As for Figure 44, but for 155°W. 
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Figure 46: As for Figure 44, but for 125°W. 
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Figure 47: As for Figure 44, but for 35°W. 
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Figure 48: As for Figure 44, but for 25°W. 


52 




123.0 


iHhfchi 

ico 

200 BRS 

av.oW J Jr~^ 

5oa ■ UUlU - — -2T /. 


'tea* 

1500 f 

4000 H 

4500^ ^ 1 1 

• 0 m ■ . | tmn-j! reTrm^i.,. ■» ■^_ 

ioo ™| 
i =5 

300^^1 n\ v?— l ~ 
JD0 r W\\ v ^^ : ^ : — 


::;KU| 1 1 

150C 

4ooo 

4500 ® ^ — P 

rr 

0 H= rq— j 

LOO | f ", h-s, 

200 c^S 'H i \N, 1 \ > 

300 1 ) IP/ \ ® 1 ^ 

400 1 « Iai < 1 1 

o Li 

srrfiri 
2000 \ 

2500 l ^®SHPjpi l 
3000 

3500 • \ 

:::t i_ r 

If 




20.0 

15.0 

10.0 


4.0 


3.0 


JO 


1.5 


1.0 



T| n C) 


1-25 

1-00 

0.75 

0,50 

0.25 

o.ao 

43-25 

■0-50 

- 0.75 

■ 1.00 

1.25 

-1.50 


T rq 



4500 

0 

100 

200 

300 

400 

500 

1000 
1500 
* 2000 
| 2500 
^ 3000 
3500 
4000 

4500 

a 

ICO 

20a 

300 

400 


g 1000 

I 1500 
^ 2000 
S 2500 

§ 3000 
O 

g 3500 
S 4000 
4500 



1 36.00 
35.70 
35.40 
35.00 
34.60 
S 34-74 
34 - 7 ° 
" 34.60 
I 34.66 
■ 34.60 

1 34.40 
34,20 

34.00 
33.80 

32.00 

31,00 



5 


Figure 49: As for Figure 44, but for 60°E. 
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Figure 50: As for Figure 44, but for 90°E. 
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Figure 51: Depth of the o 0 =25.6 kg m 3 potential density surface (top row), temperature on that surface 
(middle row), and salinity (bottom row). The leftmost panel in each set is from the GEOS ODAS5.2 analysis; 
the center panel is from WOA09 and the rightmost panel shows the difference (ODAS minus WOA09). 
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Figure 55: As for Figure 51, but for cj 1 =3 1.398 kg 
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